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DETAILED ACTION 

The amendment and response filed 7/23/2007 are acknowledged. Claim 34 has 
been amended. Claims 6-9, 26, 27, 32-34 are pending and under current examination. 

Unless otherwise indicated, previous rejections that have been rendered moot in 
view of the amendment to pending claims or persuasive arguments will not be 
reiterated. The arguments in 7/23/07 response would be addressed to the extent that 
they apply to current rejection. 



Claim Rejections • 35 USC §112 

The following is a quotation of the first paragraph of 35 U.S.C. 112: 

The specification shall contain a written description of the invention, and of the manner and process of 
making and using it, in such full, clear/concise, and exact terms as to enable any person skilled in the 
art to which it pertains, or with which it is most nearly connected, to make and use the same and shall 
set forth the best mode contemplated by the inventor of carrying out his invention. . 

Claims 6, 9 stand rejected under 35 U.S.C. 1 12, first paragraph, as failing to 
comply with the written description requirement. The claim(s) contains subject matter 
which was not described in the specification in such a way as to reasonably convey to 
one skilled in the relevant art that the inventor(s), at the time the application was filed, 
had possession of the claimed invention. 

In the remarks, the applicant first asserts, 

In claim 6, the nucleic acid sequences encode peptides comprising an 
epitope of the VCA-p1 8 or VCA-p40 protein. 
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In response, it is noted claim 6 as written, the nucleic acid sequence encodes a 
peptide that is immunochemically reactive with antibodies to VCA-1 8 or VCA-p40. 
Hence, the genus embraced by the claimed nucleic acids may encompass coding 
sequences for peptides comprising an epitope of the VCA-p18 or VCA-p40, but they are 
not limited by such coding sequence, rather the genus encompasses nucleic acid 
sequences encoding any protein as long as it is immunochemically reactive with 
antibodies to VCA-1 8 or VCA-p40, which include proteins having no clear relationship to 
VCA proteins. Claim 6 contains multiple phrases further limiting the antibodies, but not 
further limiting the peptide itself. The only limitation to the peptide is its immunochemical 
reactivity to the antibodies. One cannot extrapolate the structures of the protein genus 
from the immune reactivity. 

Applicants then argues that the antibodies are further defined by particular 
hybridoma, and one of skill in the art further recognizes that monoclonal antibodies are 
specific to a single epitope and all of the antibodies produced from a single hybridoma 
are identical. 

In response, it is noted that the antibodies are not limited by the recited 
hybridomas, but rather limited by the immunochemical reactivity: "are antibodies having 
the same reactivity with VCA-p18 as antibodies produced by the hybridoma...". 

Even assuming arguendo claims are directed to peptides reactive with the 
antibodies produced by the recited hybridoma, the peptides encompass a genus of 
amino acid sequences, whose nucleic acid coding sequences are limited only by the 
fact that they contain a nucleotide, whose expression product has certain immune 
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reactivity , while since there is no length limitation on the nucleic acid sequences, they 
could be a genomic sequence or a fusion protein sequence, or any other known or 
unknown nucleic acid sequences that happen to encode a protein reactive to the 
antibodies or contain sequences sharing homology with SEQ ID Nos: 1 and/or 3. Thus, 
even if the antibodies produced by the hybridoma have the same reactivity with VCA-18 
or VCA-40, the proteins reactive to the antibodies may be structurally distinct. This 
could be seen in the example of the specification, where serum proteins reactive with 
applicant's antibody have different domain combinations, i.e structurally different (see 
e.g. table I of the specification). 

Applicants then argue that the specification demonstrates actual reduction to 
practice of the nucleic acids of claim 6 as exemplified by peptides represented by 
various SEQ ID Nos: 2, 4, 5, 6, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 and 
22. 

In response, it is noted that majority of the numbered peptides are only a partial 
sequence of a protein, which each has an amino acid sequence distinct from the other, 
yet they all demonstrated immune reactivity with a VCA-p18 antibody. Among the 15 
proteins listed in table I, only proteins from serum No. 9 and 10 have the same 
combination of domains I, II, & III, yet still it's unclear whether they have the same 
protein sequences in the remaining portion of the immune reactive protein. One skilled 
in the art could not extrapolate the full-length protein sequences from the disclosure of 
the partial sequences and their immune reactivity. Thus, working examples of the 
specification, especially table I further supports the Office position, the specification fails 
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to provide an adequate description for the genus of the nucleic acid sequences 
encompassed by the claims. 

An adequate written description for a genus of nucleic acid sequences 
encompassed by instant claims requires more than a mere statement that they are part 
of the invention, what is required is a description of the sequences themselves. The 
court has made it very clear "Conception of chemical compound requires that inventor 

BE ABLE TO DEFINE COMPOUND SO AS TO DISTINGUISH IT FROM OTHER MATERIALS, AND TO DESCRIBE 
HOW TO OBTAIN IT, RATHER THAN SIMPLY DEFINING IT SOLELY BY ITS PRINCIPAL BIOLOGICAL 

activity". Amgen Inc. v. Chugai Pharmaceutical Co. Ltd., 18 USPQ2d 1016 (Fed. Cir. 
1991). 

The Revised Interim Guidelines state "The claimed invention as a whole may not 

BE ADEQUATELY DESCRIBED IF THE CLAIMS REQUIRE AN ESSENTIAL OR CRITICAL ELEMENT WHICH IS 
NOT ADEQUATELY DESCRIBED IN THE SPECIFICATION AND WHICH IS NOT CONVENTIONAL IN THE ART" 

(Column 3, page 71434), "when there is substantial variation within the genus, one 

MUST DESCRIBE A SUFFICIENT VARIETY OF SPECIES TO REFLECT THE VARIATION WITHIN THE GENUS", 
"IN AN UNPREDICTABLE ART, ADEQUATE WRITTEN DESCRIPTION OF A GENUS WHICH EMBRACES 
WIDELY VARIANT SPECIES CANNOT BE ACHIEVED BY DISCLOSING ONLY ONE SPECIES WITHIN THE 

genus" (Column 2, page 71436). 

Vas-Cath Inc. v. Mahurkar, 19USPQ2d 1111, clearly states that "applicant must 

CONVEY WITH REASONABLE CLARITY TO THOSE SKILLED IN THE ART THAT, AS OF THE FILING DATE 
SOUGHT, HE OR SHE WAS IN POSSESSION OF THE INVENTION. THE INVENTION IS, FOR PURPOSES OF 
THE 'WRITTEN DESCRIPTION' INQUIRY, WHATEVER IS NOW CLAIMED." (See page 1117.) The 
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specification does not "clearly allow persons of ordinary skill in the art to recognize that 
[he or she] invented what is claimed." (See Vas-Cath at page 1116). 

One cannot describe what one has not conceived . , See Fiddes v. Baird, 30 
USPQ2d 1481 , 1483. In Fiddes, claims directed to mammalian FGF's were found to be 
unpatentable due to lack of written description for that broad class. The specification 
provided only the bovine sequence. 

In view of these considerations, a skilled artisan would not have viewed the 
teachings of the specification as sufficient to show that the applicant was in possession 
of the claimed invention commensurate to its scope because it does not provide 
adequate written description for the genus sequences encompassed by the claims. 
Therefore, for reasons of record and set forth supra, the rejection stands. 

Claims 6, 9 stand rejected under 35 U.S.C. 112, first paragraph, as failing to 
comply with the enablement requirement. The claim(s) contains subject matter which 
was not described in the specification in such a way as to enable one skilled in the art to 
which it pertains, or with which it is most nearly connected, to make and/or use the 
invention, for reasons of record and set forth supra. 

Claim Rejections - 35 USC §102 

The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 
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(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

Claims 6-9, 26, 27, 32-34 stand rejected under 35 U.S.C. 102(b) as being 
anticipated by Laux et al. (The EMBO J 1988;7:769-74). 

Laux et al. teaches a nucleic acid sequence comprising a subsequence of instant 
SEQ ID No: 1 (residues 1-535 of SEQ ID No: 1), which encodes at least 12 contiguous 
amino acids of EBV VCA-p18 (the amino acid sequence SEQ No: 5), which would be 
immunochemically reactive with antibodies to the EBV VCA-p18. Laux et al. also 
teaches a nucleic acid sequence comprising instant SEQ ID No: 3, which encodes 12 
contiguous amino acids of an EBV VCA-40. Laux et al. discloses a vector comprising 
the sequences (e.g. fig. 1). Accordingly, Laux et al. anticipates instant claims. 

In the remarks, the applicant asserts that multiple alignments were performed 
and homology as stated in the Office action was not found. 

In response, the printed copy of the database search result is hereby enclosed, 
which contains the details of the alignment. 

Claims 6-9, 26, 27, 32-34 stand rejected under 35 U.S.C. 102(b) as being 
anticipated by Bankier et al. (Mol Biol Med 1 983; 1 :425-445). 

Bankieret al. teaches a nucleic acid sequence comprising a subsequence of 
instant SEQ ID No: 1 (residues 1-535 of SEQ ID No: 1), which encodes at least 12 
contiguous amino acids of EBV VCA-p18 (the amino acid sequence SEQ No: 5), which 
would be immunochemically reactive with antibodies to the EBV VCA-p18. Bankieret 
al. also teaches a nucleic acid sequence comprising a sequence that shares 98.8% 
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homology with instant SEQ ID No: 3 (subsequence thereof), which encodes 12 
contiguous amino acids of an EBV VCA-40. Bankieret al. discloses that the sequences 
were cloned in a vector (e.g. figs. 1-3). Accordingly, Bankieret al. anticipates instant 
claims. 

In the remarks, the applicant asserts that multiple alignments were performed 
and homology as stated in the Office action was not found. 

In response, the printed copy of the database search result is hereby enclosed, 
which contains details of the alignment. 



No claim is allowed. 

THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1 . 1 36(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the mailing date of this final action. 
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Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Q. Janice Li whose telephone number is 571-272-0730. 
The examiner can normally be reached on 9:30 am - 7 p.m., Monday through Friday, 
except every other Wednesday. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Joseph Woitach can be reached on 571-272-0739. The fax numbers for 
the organization where this application or proceeding is assigned are 571-273-8300. 

Any inquiry of a general nature or relating to the status of this application or 
proceeding should be directed to (571) 272-0547. For all other customer support, 
please call the USPTO Call Center (UCC) at 800-786-9199. 




Q. JANICE U.M.D. 
PRIMARY/EXAMINER 



QJL 

October 10, 2007 



SCORE Search Results Details for Application 1 0036729 and Search Result SitemName. Page 6 of 1 6 

CACTCGGGGCCTTACGGATTTCA(pCCTCATCAAAGCTA(iGAAGTGCCCAGATACGTCCCT 720 
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CACTCGGGGCCTTACGGATTTCAGCCTCATCAAA&CTACGAAGTGCCCAGATACGTCCCT 137137 

CATCCGCCCCCACCACCAACTTCTCACCAGGCAGCTCAGGCGCAGCCTCCACCCCCGGGC 780 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | 
CATCCGCCCCCACCACCAACTTCTCACCAGGCAGCTCAGGCGCAGCCTCCACCCCCGGGC 137197 

ACACAGGCCCCCGAAGCCCACTGTGTGGCCGAGTCCACGATCCCTGAGGCGGGAGCAGCC 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II | | I I I I I I I I I I | 

ACACAGGCCCCCGAAGCCCACTGTGTGGCCGAGTCCACGATCCCTGAGGCGGGAGCAGCC 137257 

GGGAACTCTGGACCCCGGGAGGACACCAACCCTCAGCAGCCCACCACCGAGGGCCACCAC 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | 

GGGAACTCTGGACCCCGGGAGGACACCAACCCTCAGCAGCCCACCACCGAGGGCCACCAC 137317 

CGCGGAAAGAAACTGGTGCAGGCCTCTGCGTCCGGAGTGGCTCAGTCTAAGGAGCCCACC 960 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

CGCGGAAAGAAACTGGTGCAGGCCTCTGCGTCCGGAGTGGCTCAGTCTAAGGAGCCCACC 137377 

ACCCCCAAGGCCAAGTCTGTGTCAGCCCACCTCAAGTCCATCTTTTGCGAGGAATTGCTG 1020 
N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | II | | 
ACCCCCAAGGCCAAGTCTGTGTCAGCCCACCTCAAGTCCATCTTTTGCGAGGAATTGCTG 1374 37 

AATAAACGCGTGGCTTGA 1038 
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RESULT 7 

EBV 

LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 



EBV 172281 bp DNA circular VRL 18-APR-2005 

Epstein-Barr virus (EBV) genome, strain B95-8. 

V01555 J02070 K01729 K01730 V01554 X00498 X00499 X00784 

V01555.1 GI:59074 

DNA polymerase; EBNA; genome; ribonucleotide reductase; tandem 

repeat; terminal repeat. 

Human herpesvirus 4 (Epstein-Barr virus) 

Human herpesvirus 4 

Viruses; dsDNA viruses, no RNA stage; Herpesviridae; 

Gammaherpes virinae ; Lymphocryptovirus . 

1 

Arrand,J.R., Rymo,L., Walsh, J. E . , Bjorck,E., Lindahl,T. and 
Griffin, B.E. 

Molecular cloning of the complete Epstein-Barr virus genome as a 

set of overlapping restriction endonuclease fragments 

Nucleic Acids Res. 9 (13), 2999-3014 (1981) 

6269068 

2 

Kozak,M. 

Possible role of flanking nucleotides in recognition of the AUG 

initiator codon by eukaryotic ribosomes 

Nucleic Acids Res. 9 (20), 5233-5252 (1981) 

7301588 

3 

Deininger,P.L. , Bankier,A., Farrell,P., Baer,R. and Barrell,B. 

Sequence analysis and in vitro transcription of portions of the 

Epstein-Barr virus genome 

J. Cell. Biochem. 19 (3), 267-274 (1982) 

6296170 

4 
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TITLE 
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AUTHORS 
TITLE 
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REFERENCE 
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TITLE 

JOURNAL 
PUBMED 
REFERENCE. 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 



Farrell, P. J. , Bankier,A., Seguin,C, Deininger, P. and Barrell,B.G. 

Latent and lytic cycle promoters of Epstein-Barr virus 

EMBO J. 2 (8), 1331-1338 (1983) 

10872327 

5 

Farrell, P.J. , Deininger , P . L . , Bankier,A. and Barrell,B. 
Homologous upstream sequences near Epstein-Barr virus promoters 
Proc. Natl. Acad. Sci . U.S.A. 80 (6), 1565-1569 (1983) 
6300857 

6 (bases 142687 to 159853) 

Bankier,A.T. , Deininger , P . L ., Farrell,P.J. and Barrell,B.G. 
Sequence analysis of the 17,166 base-pair EcoRI fragment C of B95-8 
Epstein-Barr virus 

Mol. Biol. Med. 1 (1), 21-45 (1983) 
6092825 

7 (bases 112620 to 125316) 
Seguin,C., Farrell, P. J. and Barrell,B.G. 

DNA sequence and transcription of the BamHI fragment B region of 

B95-8 Epstein-Barr virus 

Mol. Biol. Med. 1 (3), 369-392 (1983) 

6094953 

8 (bases 45644 to 52450) 
Jeang,K.T. and Hayward,S.D. 

Organization of the Epstein-Barr virus DNA molecule. III. Location 
of the P3HR-1 deletion junction and characterization of the NotI 
repeat units that form part of the template for an abundant 
12-0-tetradecanoylphorbol-13-acetate-induced mRNA transcript 
J. Virol. 48 (1), 135-148 (1983) 
6310141 

9 (bases 159853 to 172281) 

Bankier,A.T., Deininger , P . L . , Satchwell, S . C . , Baer,R., Farrell, P.J. 
and Barrell, B.G. 

DNA sequence analysis of the EcoRI Dhet fragment of B95-8 
Epstein-Barr virus containing the terminal repeat sequences 
Mol. Biol. Med. 1 (4), 425-445 (1983) 
6094955 

10 (bases 45415 to 52824) 

Jones, M.D., Foster, L., Sheedy,T. and Griffin, B.E. 
The EB virus genome in Daudi Burkitt's lymphoma cells has a 
deletion similar to that observed in a non-transforming strain 
(P3HR-1) of the virus 
EMBO J. 3 (4), 813-821 (1984) 
6327290 

11 (bases 87650 to 92703) 

Biggin, M., Farrell, P.J. and Barrell,B.G. 

Transcription and DNA sequence of the BamHI L fragment of B95-8 

Epstein-Barr virus 

EMBO J. 3 (5), 1083-1090 (1984) 

6203743 

12 (bases 7315 to 9312) 

Yates,J., Warren, N . , Reisman,D. and Sugden,B. 
A cis-acting element from the Epstein-Barr viral genome that 
permits stable replication of recombinant plasmids in latently 
infected cells 

Proc. Natl. Acad. Sci. U.S.A. 81 (12), 3806-3810 (1984) 
6328526 

13 (bases 76089 to 79808) 

Gibson, T., Stockwell, P . , Ginsburg, M. and Barrell, B. 
Homology between two EBV early genes and HSV ribonucleotide 
reductase and 38K genes 

Nucleic Acids Res. 12 (12), 5087-5099 (1984) 
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PUBMED 
REFERENCE 

AUTHORS 



TITLE 
JOURNAL 
PUBMED 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 

REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
PUBMED 

REFERENCE 
AUTHORS 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



Deininger, P. L. , Fa Hell, P.J. , 
, Satchwell, S.C . , Seguin,C, 



6330697 

14 (bases 1 to 172281) 
Baer,R. , Bankier , A . T . , Biggin, M.D., 
Gibson, T. J., Hatfull,G., Hudson, G.S. 
Tuf fnell, P.S. and Barrell,B.G. 
DNA sequence and expression of the B95-8 Epstein-Barr virus genome 
Nature 310 (5974), 207-211 (1984) 
6087149 
15 

Bodescot,M. and Perricaudet, M . 

Clustered alternative splice sites in Epstein-Barr virus RNAs 

Nucleic Acids Res. 15 (14), 5887 (1987) 

3039467 

16 

Laux,G., Perricaudet, M. and Farrell,P.J. 

A spliced Epstein-Barr virus gene expressed in immortalized 
lymphocytes is created by circularization of the linear viral 
genome 

EMBO J. 7 (3) , 769-774 (1988) 

2840285 

17 

Hatfull, G.F. 
Unpublished 
18 (bases 1 
Farrell, P.J. 



Barrell, B .G. , Quinn,J. and McGeoch,D. 



to 172281) 
and Barrell, B.G. 
Direct Submission 
Submitted ( 05- JUN-1984 ) 
19 (bases 1 to 172281) 
Farrell, P. J. 
Direct Submission 

Submitted ( 18-MAR-1988 ) Farrell P., Ludwig Institute for Cancer 
Research, St. Mary's Hospital Medical School, Norfolk Place London 
W2 IPG 

On or before Apr 23, 2004 this sequence version replaced gi: 330432, 

gi: 330357, gi: 3304 13. 

CDS 

Listed under this feature are all known protein coding regions as 
well as all the major open reading frames in the sequence. In 
general the term major is taken as the longest frame in a 
particular region taking into account the adjacent longest frames 
and likely transcription signals. Note that on this basis some long 
overlapping frames have been excluded and on the other hand some 
small frames have been included which might represent exons or 
genes because they occur in a logical combination with other 
features or because of some other experimental data. The reading 
frames are named according to the Bam HI fragment in which they 
start, eg BALF3 is the third leftward frame starting in Bam HI 
fragment A. BORF1 is the first rightward frame in Bam HI fragment 
0. If there is an obvious TATA sequence followed by an in frame Met 
codon that satisfies the rules of Kozak [12] in that there is a 
purine at -3 and/or a G at +4 then the reading frame is numbered 
from the A of the ATG to the base preceding the termination codon. 
If there is no obvious initiation codon or there is a substantial 
reading frame in phase before the ATG then the reading frame is 
numbered from the first base of the first codon. 
SITEs of POLYA signals 

This feature lists all occurences of the sequence AATAAA which is 
found normally approximately 20 bases upstream of the mRNA 
processing/polyA addition site. The rarely used homolog ATT AAA is 
only listed when it is found in a position close to the end of a 
major reading frame. 
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FEATURES 

source 



mRNA 



mRNA 



misc feature 



mRNA 



mRNA 



mRNA 



SITEs of DONOR and ACCEPT sequences 

This is not a comprehensive listing of all such sequences and only 
the positions of a few have been noted because they occur in 
potentially interesting positions. The number quoted in the table 
is the position of the terminal base in the intron in each case. 
Restriction enzyme SITEs. 

Only the positions of the sites Bam HI (BAM) are listed. RPT 
This feature is used to define repetitive sequences. SITE DEL 
This feature defines deletions in B95-8 with respect to other 
strains such as RAJI and also to deletions in other strains such as 
P3HR1 and DAUDI with respect to B95-8. 
SITE HPN 

Denotes sequences with twofold symmetry ie could form hairpin 
loops. This is not a comprehensive list - only a few occurences 
noted. 
ORGRPL 

Denotes the region that encompasses an origin of replication (ori 

P).[13]. 

NUMBERING 

The DNA sequence of B95-8 EBV has been revised [19] . The original 
(Baer et al, 1984) base 359 has been deleted so the new sequence 
around that position reads TCAGTCTTT. To avoid renumbering the 
entire sequence, position 1 has benn moved 1 base to the left of 
the EcoRI site separating EcoRI Dhet from EcoRI I 
(ie the first A of AGAATTC) . 

Location/Qualifiers 

1. .172281 

/organism="Human herpesvirus 4" 
/mol_type="genomic DNA" 
/strain="B95-8" 
/db_xref="taxon: 10376" 
58. .272 

/product="exon 2 terminal protein RNA" 
360. .458 

/product="exon 3 terminal protein RNA" 
complement (535) 
/note="polyA signal: AATAAA" 
540. .788 

terminal protein RNA" 



promoter 



mRNA 



promoter 



/product="exon 4 
871. .951 
/product="exon 5 
1026. .1196 
/product="exon 6 
complement (1192) 
/ n o t e = " TATA : T AT AAAT " 
1280. .1495 

/product="exon 7 terminal protein RNA" 
complement (1383) 



terminal protein RNA" 
terminal protein RNA" 



Query Match 100.0%; Score 1038; DB 13; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 1038; Conservative 0; Mismatches 



Qy 



Db 



Qy 



Db 



Length 172281; 
0; Indels 0; Gaps 



0; 



1 ATGCTATCAGGTAACGCAGGAGAAGGAGCAACAGCCTGCGGAGGTTCGGCCGCCGCGGGC 60 
I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | 
14 87 07 ATGCTATCAGGTAACGCAGGAGAAGGAGCAACAGCCTGCGGAGGTTCGGCCGCCGCGGGC 14 87 66 

6 1 CAGGACCTCATCAGCGf CCCCCGCAACACCTTTATGACACTGCTTCAGACCAACCTGGAC 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | 

14 87 67 CAGGACCTCATCAGCGTCCCCCGCAACACCTTTATGACACTGCTTCAGACCAACCTGGAC 148826 
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Qy 121 AACAAACCGCCGAGGCAGACCCCGCTACCCTACGCGGCCCCGCTGCCCCCCTTTTCCCAC 180 

I 1 1 I I I I I I I I I I I I I I I I I I I I I I I | I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

Db 14 8827 AACAAACCGCCGAGGCAGACCCCGCTACCCTACGCGGCCCCGCTGCCCCCCTTTTCCCAC 14 88 8 6 

QY 181 CAGGCAATAGCCACCGCGCCTTCCTACGGTCCTGGGGCCGGAGCGGTCGCCCCGGCCGGC 24 0 

I I I I I I I I I I I I M M I I I I I I I I I 'I I I I I I I I I I I I I I I I II 

Db 14 88 87 CAGGCAATAGCCACCGCGCCTTCCTACGGTCCTGGGGCCGGAGC.GGTCGCCCCGGCCGGC 14 894 6 

QY 241 GGCTACTTTACCTCCCCAGGAGGTTACTACGCCGGGCCCGCGGGCGGGGACCCGGGTGCC 300 

MINI I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 1 I I I II I I 

Db 14 8947 GGCTACTTTACCTCCCCAGGAGGTTACTACGCCGGGCCCGCGGGCGGGGACCCGGGTGCC 14 9006 

Qy 301 TTCTTGGCGATGGACGCTCACACCTACCACCCCCACCCACACCCCCCTCCGGCCTACTTT 360 

N I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 149007 TTCTTGGCGATGGACGCTCACACCTACCACCCCCACCCACACCCCCCTCCGGCCTACTTT 149066 

Qy 361 GGCTTGCCGGGCCTCTTTGGCCCCCCTCCACCCGTGCCTCCTTACTACGGATCCCACTTG 420 

I I I N I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I II I M | | | | || 
Db 149067 GGCTTGCCGGGCCTCTTTGGCCCCCCTCCACCCGTGCCTCCTTACTACGGATCCCACTTG 149126 

Qy 421 CGGGCAGACTACGTCCCCGCTCCCTCGCGATCCAACAAGCGGAAAAGAGACCCCGAGGAG 4 80 

N M I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | 1 1 | 
Db 14 9127 CGGGCAGACTACGTGCCCGCTCCCTCGCGATCCAACAAGCGGAAAAGAGACCCCGAGGAG 14 918 6 

Qy 4 81 GATGAAGAAGGCGGGGGGCTATTCCCGGGGGAGGACGCCACCCTCTACCGCAAGGACATA 54 0 

I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | I I I I I I I I I I I I I 
Db 14 9187 GATGAAGAAGGCGGGGGGCTATTCCCGGGGGAGGACGCCACCCTCTACCGCAAGGACATA 14 9246 

Qy 541 GCGGGCCTCTCCAAGAGTGTGAATGAGTTACAGCACACGCTACAGGCCCTGCGCCGGGAG 600 

1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

■Db 14 9247 GCGGGCCTCTCCAAGAGTGTGAATGAGTTACAGCACACGCTACAGGCCCTGCGCCGGGAG 149306 

Qy 601 ACGCTGTCCTACGGCCACACCGGAGTCGGATACTGCCCCCAGCAGGGCCCCTGCTACACC 660 

I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | || | | | | | | | | | 

Db 14 9307 ACGCTGTCCTACGGCCACACCGGAGTCGGATACTGCCCCCAGCAGGGCCCCTGCTACACC 14 9366 

Qy 66 1 CACTCGGGGCCTTACGGATTTCAGCCTCATCAAAGCTACGAAGTGCCCAGATACGTCCCT 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | I I I I I II I I I I I I I II I I I I I I I 
Db 14 9367 CACTCGGGGCCTTACGGATTTCAGCCTCATCAAAGCTACGAAGTGCCCAGATACGTCCCT 14 9426 

Q y 721 CAT CCGCCCCCACCACCAACTTCTCACCAGGCAGCTCAGGCGCAGCCTCCACCCCCGGGC 780 

I I M I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 14 9427 CATCCGCCCCCACCACCAACTTCTCACCAGGCAGCTCAGGCGCAGCCTCCACCCCCGGGC 14 9486 

Q y 781 ACACAGGCCCCCGAAGCCCACTGTGTGGCCGAGTCCACGATCCCTGAGGCGGGAGCAGCC 840 

> I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 14 94 87 ACACAGGCCCCCGAAGCCCACTGTGTGGCCGAGTCCACGATCCCTGAGGCGGGAGCAGCC 14 9546 

Qy 841 GGGAACTCTGGACCCCGGGAGGACACCAACCCTCAGCAGCCCACCACCGAGGGCCACCAC 900 

I I I I I I I II M I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | || | | | | | | | 
Db 14 9547 GGGAACTCTGGACCCCGGGAGGACACCAACCCTCAGCAGCCCACCACCGAGGGCCACCAC 14 9606 

Q y 901 CGCGGAAAGAAACTGGTGCAGGCCTCTGCGTCCGGAGTGGCTCAGTCTAAGGAGCCCACC 960 

I I I N I I N M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 14 9607 CGCGGAAAGAAACTGGTGCAGGCCTCTGCGTCCGGAGTGGCTCAGTCTAAGGAGCCCACC 14 9666 

Qy 961 ACCCCCAAGGCCAAGTCTGTGTCAGC'CCACCTCAAGTCCATCTTTTGCGAGGAATTGCTG 1020 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 14 9667 ACCCCCAAGGCCAAGTCTGTGTCAGGCCACCTCAAGTCCATCTTTTGCGAGGAATTGCTG 14 9726 

Qy 1021 AATAAACGCGTGGCTTGA 1038 
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Db 149727 AATAAACGCGTGGCTTGA 149744 



RESULT 8 

HS4B958RAJ 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
PUBMED 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



misc feature 



misc feature 



misc feature 



HS4B958RAJ 184113 bp DNA linear VRL 12-APR-1996 

Epstein-Barr virus, artifactual joining of B95-8 complete genome 
and the sequences from Raji of the large deletion found in B95-8. 
M80517 M75989 
M80517.1 GI: 330330 

Human herpesvirus 4> (Epstein-Barr virus) 
Human herpesvirus 4 

Viruses; dsDNA viruses, no RNA stage; Herpesviridae; 
Gammaherpesvirinae; Lymphocryptovirus . 

1 (sites) 

Baer,R.J., Bankier , A . T . , Biggin,M.D., Deininger , P . L . , Farrell, P . J. , 
Gibson, T. J., Hatf ull , G . F. , Hudson, G.S., Satchwell , S . C . , Seguin,C, 
Tuf fnell, P.S. and Barrell,B.G. 

DNA sequence and expression of the B95-8 Epstein-Barr virus genome 

Nature 310 (5974), 207-211 (1984) 

6087149 

2 (sites) 

Parker, B.D., Bankier, A., Satchwell, S . , Barrell,B. and Farrell, P.J. 
Sequence and transcription of Raji Epstein-Barr virus DNA spanning 
the B95-8 deletion region 
Virology 179 (1), 339-346 (1990) 
2171209 

3 (sites) 

Sample, J., Brooks, L., Sample, C, Young, L., Rowe,M., Gregory, C, 
Rickinson,A. and Kieff,E. 

Restricted Epstein-Barr virus protein expression in Burkitt 
lymphoma is due to a different Epstein-Barr nuclear antigen 1 
transcriptional initiation site 

Proc. Natl. Acad. Sci . U.S.A. 88 (14), 6343-6347 (1991) 
1648738 

4 (bases 1 to 184113) 
Jenson,H.B. 

GenBank Curator Program 
Unpublished (1992) 

Original source text: Human herpesvirus 4 DNA. 
The B95-8 genome (V01555) has a large deletion in the 
the genome which has been sequenced in Raji (M35547) . 
sequences have been joined to form an extended and more complete, 
although artifactual, EBV sequence. 

For features, refer to feature tables of V01555 and M35547. 
Location/Qualifiers 
1. .184113 

/organism="Human herpesvirus 4" 
/mol_type="genomic DNA" 
/db_xref =" taxon : 1 03 7 6 " 
1. .152008 

/note="B95-8 sequences (corresponds to 1-152,008 of 
V01555) " 
152009. .152012 

/note="Overlap of B95-8 and Raji sequences at B95-8 
deletion point (corresponds to 152,009-152,012 in V01555, 
and 1-4 in M35547) " 
153013. .163839 

/note="Raji sequences (corresponds to 5-11,831 of M35547)" 



right side of 
These 
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RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RX MEDLINE=84270667; PubMed-6087 1 4 9 / DOI=10 . 1038/310207a0 ; 

RA Baer R., Bankier A.T., Biggin M.D., Deininger P.L., Farrell P.J., 

RA Gibson T.J., Hatfull G., Hudson G.S., Satchwell S.C., Seguin C, 

RA Tuffnell P.S., Barrell B.G.; 

RT "DNA sequence and expression of the B95-8 Epstein-Barr virus genome."; 

RL Nature 310:207-211(1984). 

RN [2] 

RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] . 

RX MEDLINE=86045997; PubMed=2998075 ; 

RA Hudson G.S., Gibson T.J., Barrell B.G.; 

RT "The BamHI F region of the B95-8 Epstein-Barr virus genome."; 

RL Virology 147:99-109(1985) . 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL / V01555; CAA24838.1; ALT__INIT; GenomicJDNA. 

DR EMBL; M11923; AAA45870.1; -; GenomicJDNA. 

DR InterPro; IPR009299; Herpes_capsid. 

DR Pfam; PF06112; Herpes_capsid; 1. 

KW Capsid protein; Virion protein. 

FT CHAIN 1 176 Capsid protein VP26. 

FT /FTId=PRO_0000115739. 

SQ SEQUENCE 176 AA; 18147 MW; DAB605ED00F1A656 CRC64; 

Query Match.. 100.0%; Score 130; DB 1; Length 176; 

Best Local Similarity 100.0%; Pred. No. 2.9e-09; 

Matches 24; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 AVDTGSGGGGQPHDTAPRGARKKQ 24 

I I I I I I I I I I I I I I I I I I I I I I I I 
Db 153 AVDTGSGGGGQPHDTAPRGARKKQ 176 



RESULT 2 
Q777G5_EBVG 

ID Q777G5_EBVG PRELIMINARY; PRT; 176 AA. 

AC Q777G5; 

DT 05-JUL-2004, integrated into UniProtKB/TrEMBL . 

DT 05-JUL-2004, sequence version 1. 

DT 13-JUN-2006, entry version 11. 

DE Capsid protein VP26. 

GN Name=BFRF3; 

OS Epstein-Barr virus (strain GDI) (HHV-4) (Human herpesvirus 4). 

OC Viruses; dsDNA viruses, no RNA stage; Herpesviridae ; 

OC Gammaherpesvirinae; Lymphocryptovirus . 

OX NCBI_TaxID=10376; 

OH NCBIJTaxID=9606; Homo sapiens (Human) . 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RX MEDLINE=84270667; PubMed=6087149; DOI=10 . 1038/310207a0; 

RA Baer R., Bankier A.T., Biggin M.D., Deininger P.L., Farrell P.J., 

RA Gibson T.J., Hatfull G., Hudson G.S., Satchwell S.C., Seguin C, 

RA Tuffnell P.S., Barrell B.G.; 

RT "DNA sequence and expression of the B95-8 Epstein-Barr virus genome."; 

RL Nature 310:207-211 (1984) . 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 
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RC STRAIN=B95-8; 

RX MEDLINE=8828364 6; PubMed=2840285 ; 

— -^RA LauxH% Perricaudet M . , Farrell P.J.; 

/ RT "A spliced Epstein-Barr virus gene expressed in immortalized 

RT lymphocytes is created by circularization of the linear viral 

RT genome . " ; 

RL EMBO J. 7:769-774(1988). 

RN [3] 

RP NUCLEOTIDE SEQUENCE . 

RC STRAIN=B95-8; 

RX MEDLINE=82014887; PubMed=6269068 ; 

RA Arrand J.R., Rymo L., Walsh J.E., Bjorck E . , Lindahl T., Griffin B.E.; 

RT "Molecular cloning of the complete Epstein-Barr virus genome as a set 

RT of overlapping restriction endonuclease fragments."; 

RL Nucleic Acids Res. 9:2999-3014(1981). 

RN [4] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RX MEDLINE=82059504; PubMed=7301588; 

RA Kozak M. ; 

RT. "Possible role of flanking nucleotides in recognition of the AUG 

RT ..initiator codon by eukaryotic ribosomes."; 

RL Nucleic Acids Res. 9 : 5233-5252 ( 1981 ) . 

RN [5] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RX MEDLINE=83109311; PubMed=6296170; 

RA Deininger P.L., Bankier A., Farrell P., Baer R., Barrell B.; 

RT "Sequence' analysis and in vitro transcription of portions of the 

RT Epstein-Barr virus . genome ." ; 

RL J. Cell. Biochem. 19:267-274(1982). 

RN [6] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RX MEDLINE=83169725; PubMed-6300857 ; 

RA Farrell P.J., Deininger P.L., Bankier A., Barrell B.; 

RT "Homologous upstream sequences near Epstein-Barr virus promoters."; 

RL Proc. Natl. Acad. Sci . U.S.A. 80:1565-1569(1983). 

RN [7] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RX MEDLINE=85035713; PubMed=6092825; 

3s RA ^|aj[tee^A-.T. , Deininger P.L., Farrell P.J., Barrell B.G.; 

' RT "Sequence analysis of the 17,166 base-pair EcoRI fragment C of B95-8 

RT Epstein-Barr virus."; 

RL Mol. Biol. Med. 1:21-45(1983). 

RN [8] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RX MEDLINE=85060424; PubMed=6094953 ; 

RA Seguin C, Farrell P.J., Barrell B.G.; 

RT "DNA sequence and transcription of the BamHI fragment B region of B95- 

RT 8 Epstein-Barr virus."; 

RL Mol. Biol. Med. 1:369-392(1983). 

. RN [9] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RX MEDLINE=83294686; PubMed=6310141; 

RA Jeang K.T., Hayward S.D.; 

RT "Organization of the Epstein-Barr virus DNA molecule. III. Location of 

RT the P3HR-1 deletion junction and characterization of the NotI repeat 
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RT units that form part of the template for an abundant 12-0- 

RT tetradecanoylphorbol-13-acetate-induced mRNA transcript."; 

RL J. Virol. 48:135-148(1983). 

RN [10] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RX MEDLINE=85060428; PubMed=6094955; 

RA Bankier A.T., Deininger P.L., Satchwell S.C., Baer R., Farrell P.J., 

RA Barrell B.G.; 

RT "DNA sequence analysis of the EcoRI Dhet fragment of B95-8 Epstein- 

RT Barr virus containing the terminal repeat sequences."; 

RL Mol. Biol. Med. 1:425-4 45(1983). 

RN [11] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RX MEDLINE=20331131; PubMed=10872327 ; 

RA Farrell P.J., Bankier A., Seguin C, Deininger P., Barrell B.G.; 

RT "Latent and lytic cycle promoters of Epstein-Barr virus."; 

RL EMBO J. 2:1331-1338(1983). 

RN [12] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RX MEDLINE=84207939; PubMed=6327290; 

RA Jones M.D., Foster L., Sheedy T., Griffin B.E.; 

RT "The EB virus genome in Daudi Burkitt's lymphoma cells has a deletion 

RT • similar to that observed in a non-transforming strain (P3HR-1) of the 

RT virus. "; 

RL EMBO J. 3.: 813-821 (1984) . 

RN [13] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RX MEDLINE=84236104; PubMed=6203743 ; 

RA Biggin M . , Farrell P.J., Barrell B.G.; 

RT "Transcription and DNA sequence of the BamHI L fragment of B95-8 

RT Epstein-Barr virus."; 

RL EMBO J. 3:1083-1090(1984). 

RN [14] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RX MEDLINE=84222045; PubMed=632 8526; 

RA Yates J., Warren N., Reisman D., Sugden B.; 

RT "A cis-acting element from the Epstein-Barr viral genome that permits 

RT stable replication of recombinant plasmids in latently infected 

RT cells."; 

RL Proc. Natl. Acad. Sci . U.S.A. 81:3806-3810(1984). 

RN [15] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RX MEDLINE=84247360; PubMed=6330697 ; 

RA Gibson T.J., Stockwell P., Ginsburg M . , Barrell B.G.; 

RT "Homology between two EBV early genes and HSV ribonucleotide reductase 

RT and 38K genes."; 

RL Nucleic Acids Res. 12:5087-5099(1984). 

RN [16] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN-B95-8; 

RX MEDLINE=87289053; PubMed=30394 67 ; 

RA Bodescot M . , Perricaudet M.; 

RT "Clustered alternative splice sites in Epstein-Barr virus RNAs . " ; 

RL Nucleic Acids Res. 15:5887-5887(1987). 

RN [17] 
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RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RX MEDLINE=91021036; PubMed=2171209; 

RA Parker B.D., Bankier A. , Satchwell S., Barrell B., Farrell P.J.; 

RT "Sequence and transcription of Raji Epstein-Barr virus DNA spanning 

RT the B95-8 deletion region."; 

RL Virology 179:339-346(1990). 

RN [18] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RA Hatfull G.F., Barrell B.G., Quinn J., McGeoch D.; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/DDB J databases. 

RN [19] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=B95-8; 

RA Binne U.K., Amon W., Farrell P.J.; 

RT "Induction of Epstein-Barr virus late promoters on small plasmids in 

RT the EBV late lytic cycle requires ori lyt."; 

RL Submitted (AUG-2002) to the EMBL/GenBank/DDBJ databases. 

CC . 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AJ507799; CAD53401.1; -; Genomic_DNA. 

DR GO; GO:0019028; C:viral capsid; IEA. 

DR InterPro; IPR009299; Herpes_capsid. 

DR Pfam; PF06112; Herpes_capsid; 1. 

SQ SEQUENCE 176 AA; 18147 MW; DAB605ED00F1A656 CRC64 ; 

Query Match 100.0%; Score 130; DB 2; Length 176; 

Best Local Similarity 100.0%; . Pred. No. 2.9e-09; 

Matches 24; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 AVDTGSGGGGQPHDTAPRGARKKQ 2 4 

I I I I I I I I I I I I I I I I I I I I I I I I 
Db 153 AVDTGSGGGGQPHDTAPRGARKKQ 17 6 



RESULT 3 
Q3KSU9_EBVG 

ID Q3KSU9_EBVG PRELIMINARY; PRT; 17 6 AA. 

AC Q3KSU9; 

DT 08-NOV-2005, integrated into UniProtKB/TrEMBL . 

DT 08-NOV-2005, sequence version 1. 

DT ll-JUL-2006, entry version 10. 

DE Hypothetical protein (BFRF3) . 

OS Epstein-Barr virus (strain GDI) (HHV-4) (Human herpesvirus 4). 

OC Viruses; dsDNA viruses, no RNA stage; Herpesviridae; 

OC Gammaherpesvirinae; Lymphocryptovirus . 

OX NCBI_TaxID=10376; 

OH NCBI_TaxID=9606; Homo sapiens (Human) . 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=GD1 ; 

RX PubMed=l 6306603; DOI=10 . 1128/ JVI . 7 9 . 2 4 . 1532 3-15330.2005; . 

RA Zeng M.-S., Li D.-J., Liu Q.-L., Song L.-B., Li M.-Z., Zhang R.-H., 

RA Yu X.-J., Wang H.-M., Ernberg I v/ Zeng Y,.-X.; 

RT "Genomic sequence analysis of Epstein-Barr virus strain GDI from a 

RT nasopharyngeal carcinoma patient."; 

RL J. Virol. 79:15323-15330(2005). 

RN [2] 
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Qy 4 81 TGGCGGGGGACAACCCCACGACACCGCCCCACGCGGGGCACGTAAGAAACAGTAG 535 

I I I I I I I I I I I I I I I I I II I I I I I I ! I I II I I I I I I I I II I I I I I I I i I I I I I I I 
Db 7130 TGGCGGGGGACAACCCCACGACACCGCCCCACGCGGGGCACGTAAGAAACAGTAG 7184 



RESULT 7 

HHV507799 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
! AUTHORS 

TITLE 

! JOURNAL 
• J PUBMED 
• REFERENCE 
I AUTHORS 
TITLE 
JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



HHV507799 171823 bp DNA circular VRL 15-APR-2005 

Human herpesvirus 4 complete wild type genome. 

AJ507799 

AJ507799.1 GI:23893576 
complete genome. 

Human herpesvirus 4 (Epstein-Barr virus) 
Human herpesvirus 4 

Viruses; dsDNA viruses, no RNA stage; Herpesviridae; 

Gammaherpes virinae ; Lymphocryptovirus . 

1 

Arrand,J.R., Rymo,L., Walsh, J. E., Bjorck,E., Lindahl,T. and 
Griff in, B .E . 

Molecular cloning of the complete Epstein-Barr virus genome as a 

set of overlapping restriction endonuclease fragments 

Nucleic Acids Res. 9 (13), 2999-3014 (1981) 

6269068 

2 

Kozak,M. 

Possible role of flanking nucleotides in recognition of the AUG 

initiator codon by eukaryotic ribosomes 

Nucleic Acids Res. 9 (20), 5233-5252 (1981) 

7301588 

3 

Deininger,P.L. , Bankier,A., Farrell, P., Baer,R. and Barrell,B. 

Sequence analysis and in vitro transcription of portions of the 

Epstein-Barr virus genome 

J. Cell. Biochem. 19 (3), 267-274 (1982) 

6296170 

4 

Farrell, P. J. , Bankier,A., Seguin,C, Deininger,P. and Barrell,B.G. 

Latent and lytic cycle promoters of Epstein-Barr virus 

EMBO J. 2 (8), 1331-1338 (1983) 

10872327 

5 

Farrell, P. J. , Deininger , P . L . , Bankier,A. and Barrell,B. 
Homologous upstream sequences near Epstein-Barr virus promoters 
Proc. Natl. Acad. Sci . U.S.A. 80 ( 6) , 1565-1569 (1983) 
6300857 
6 

Bankier, A.T. , Deininger , P. L . , Farrell, P.J. and Barrell,B.G. 
Sequence analysis of the 17,166 base-pair EcoRI fragment C of B95-8 
Epstein-Barr virus 

Mol. Biol. Med. 1 (1), 21-45 (1983) 

6092825 

7 

Seguin,C, Farrell, P.J. and Barrell,B.G. 

DNA sequence and transcription of the BamHI fragment B region of 

B95-8 Epstein-Barr virus 

Mol. Biol. Med. 1 (3), 369-392 (1983) 

6094953 

8 

Jeang,K.T. and Hayward,S.D. 

Organization of the Epstein-Barr virus DNA molecule. III. Location 
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JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
PUBMED 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 

REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



of the P3HR-1 deletion junction and characterization of the NotI 

repeat units that form part of the template for an abundant' 

12-0-tetradecanoylphorbol-13-acetate-induced mRNA transcript 

J. Virol. 48 (1), 135-148 (1983) 

6310141 

9 

Bankier,A.T. , Deininger , P . L . , Satchwell, S . C. , Baer,R., Farrell,P.J. 
and Barrell, B.G. 

DNA sequence analysis of the EcoRI Dhet fragment of B95-8 
Epstein-Barr virus containing the terminal repeat sequences 
Mol. Biol. Med. 1 (4), 425-445 (1983) 
6094955 

10 ' 

Jones, M.D., Foster, L . , Sheedy,T. and Griffin, B.E. 

The EB virus genome in Daudi Burkitt's lymphoma cells has a 

deletion similar to that observed in a non-transforming strain 

(P3HR-1) of the virus 

EMBO J. 3 (4) , 813-821 (1984) 

6327290 

11 

Biggin, M., Farrell,P.J. and Barrell,B.G. 

Transcription and DNA sequence of the BamHI L fragment of B95-8 

Epstein-Barr virus 

EMBO J. 3 (5), 1083-1090 (1984) 

6203743 

12 

Yates, J., Warren, N., Reisman,D. and Sugden,B. 
A cis-acting element from the Epstein-Barr viral genome that 
permits stable replication of recombinant plasmids in latently 
infected cells 

Proc. Natl. Acad. Sci . U.S.A. 81 (12), 3806-3810 (1984) 

6328526 

13 

Gibson, T., Stockwell, P. , Ginsburg,M. and Barrell,B. 
Homology between two EBV early genes and HSV ribonucleotide 
reductase and 38K genes 
Nucleic Acids Res. 12 (12), 5087-5099 
6330697 

14 (bases 1 to 171823) 

Baer , R. J. , Bankier,A.T. , Biggin, M.D. , 
Gibson, T. J., Hatf ull, G . F. , Hudson, G.S. 
Tuffnell, P.S. and Barrell,B.G. 
DNA sequence and expression of the B95-8 Epstein-Barr virus genome 
Nature 310 (5974), 207-211 (1984) 
6087149 
15 

Bodescot,M. and Perricaudet , M. 

Clustered alternative splice sites in Epstein-Barr virus RNAs 

Nucleic Acids Res. 15 (14), 5887 (1987) 

3039467 

16 

Laux,G., Perricaudet, M. and Farrell,P.J. 

A spliced Epstein-Barr virus gene expressed in immortalized 
lymphocytes is created by circularization of the linear viral 
genome 

EMBO J. 7 (3) , 769-774 (1988) 

2840285 

17 

Parker, B.D., Bankier,A., Satchwell, S . , Barrell,B. and Farrell,P.J. 
Sequence and transcription of Raji Epstein-Barr virus DNA spanning 
the B95-8 deletion region 



(1984) 



Deininger, P.L. , Farrell, P.J. 
Satchwell, S.C. , Seguin,C. , 
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JOURNAL Virology 179 (1), 339-346 (1990) 
PUBMED 2171209 
REFERENCE 18 {bases 1 to 171823) 

AUTHORS Hatfull,G.F., Barrell , B . G . , Quinn,J. and McGeoch, D . 

JOURNAL Unpubl i shed 
REFERENCE 19 

AUTHORS Binne,U.K., Amon,W. and Farrell,P.J. 

TITLE Induction of Epstein-Barr virus late promoters on small plasmids in 

the EBV late lytic cycle requires ori lyt 
JOURNAL Unpublished 
REFERENCE 20 (bases 1 to 171823) 
AUTHORS Farrell, P. J. 
TITLE Direct Submission 

JOURNAL Submitted ( 01-AUG-2002 ) Farrell P., Ludwig Institute for Cancer 

Research, Imperial College School of Medicine, St. Mary's Campus, 
Norfolk Place London W2 IPG 
COMMENT Construction : 

This sequence was assembled from B95-8 EBV [14] and Raji EBV [18] 
with sequence corrections [16, 19] . The number of major internal 
repeat units has been reduced from 11.6 [14] to a more typical 7.6 
and the B95-8 deletion sequences have been restored to give a 
sequence more representative of wild type EBV. 
Numbering 

Like the modified B95-8 sequence [14, 16] accession number V01555, 
this sequence starts 1 base to the left of the EcoRI site 
separating EcoRI Dhet from EcoRI i (ie the first A of AGAATTC . ) . 
FEATURES Location/Qualifiers 
source 1. .171823 

/organism="Human herpesvirus 4" 
/mol_type="genomic DNA" 
/strain="B95-8" 
/db_xref="taxon: 10376" 
source 139724. .151554 

/organism="Human herpesvirus 4" 
/mol_type="genomic DNA" 
/strain="Raji" 
/db_xref="taxon: 10376" 
CDS join(166103. .166458,58. .272,360. .458,540. .788,871. .951, 

1026. .1196,1280. .1495,1574. .1680) 
/codon_start=l 

/product="terminal protein LMP2A" 
/protein_id="CAD53382 . 1 " 
/db_xref="GI : 23893577" 
/db_xref="InterPro: IPR010881" 
/db_xref="UniProt/TrEMBL:Q777H4" 

/ translation="MGSLEMVPMGAGPPSPGGDPDGYDGGNNSQYPSASGSSGNTPTP 
PNDEERESNEEPPPPYEDPYWGNGDRHSDYQPLGTQDQSLYLGLQHDGNDGLPPPPYS 
. PRDDSSQHIYEEAGRGSMNPVCLPVIVAPYLFWLAAIAASCFTASVSTVVTATGLALS 
LLLLAAVASSYAAAQRKLLTPVTVLTAVVTFFAICLTWRIEDPPFNSLLFALLAAAGG 
LQGIYVLVMLVLLILAYRRRWRRLTVCGGIMFLACVLVLIVDAVLQLSPLLGAVTVVS 
MTLLLLAFVLWLSSPGGLGTLGAALLTLAAALALLASLILGTLNLTTMFLLMLLWTLV 
VLLICSSCSSCPLSKILLARLFLYALALLLLASALIAGGSILQTNFKSLSSTEFIPNL 
FCMLLLIVAGILFILAILTEWGSGNRTYGPVFMCLGGLLTMVAGAVWLTVMSNTLLSA 
WILTAGFLIFLIGFALFGVIRCCRYCCYYCLTLESEERPPTPYRNTV" 

gene join (58 . . 272, 360 . .458, 540. . 788 , 871 . ' . 951 , 1026 . .1196, 

1280. .1495,1574. .1682,5408. .5856,166103. .166458) 
/gene="LMP2" 

exon 58 . .272 

/gene="LMP2" 
/number =2 

CDS join(59. .272,360. .458,540. .788,871. .951,1026. .1196, 
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. 1280. .1495,1574. .1680) 
/gene="LMP2" 

/codon_start=l 

/product="terminal protein LMP2B" 
/protein_id= M CAD53383. 1" 
/db_xref="GI : 23893578" 
/db_xref="InterPro: IPR010881" 
/db_xref="UniProt/TrEMBL:Q8AZK9" 

/ translation="MNPVCLPVIVAPYLFWLAAIAASCFTASVSTVVTATGLALSLLL 
LAAVASSYAAAQRKLLTPVTVLTAVVTFFAICLTWRIEDPPFNSLLFALLAAAGGLQG 
IYVLVMLVLLILAYRRRWRRLTVCGGIMFLACVLVLIVDAVLQLSPLLGAVTVVSMTL 
LLLAFVLWLSSPGGLGTLGAALLTLAAALALLASLILGTLNLTTMFLLMLLWTLVVLL 
ICSSCSSCPLSKILLARLFLYALALLLLASALIAGGSILQTNFKSLSSTEFIPNLFCM 
LLLIVAGILFILAILTEWGSGNRTYGPVFMCLGGLLTMVAGAVWLTVMSNTLLSAWIL 
TAGFLIFLIGFALFGVIRCCRYCCYYCLTLESEERPPTPYRNTV" 

exon 360. .458 

/gene="LMP2" 
/number =3 

exon 540. .788 

/gene="LMP2" 
/number=4 

exon 871. .951 

/gene="LMP2" 
/number- 5 

exon 1026. .1196 

Query Match 98.8%; Score 531.8; DB 13; Length 171823; 

Best Local Similarity 99.6%; Pred. No. 2.5e-92; 

Matches 533; Conservative 0; Mismatches 2; Indels 0; Gaps 0;. 

CATGATGGCACGCCGGCTGCCCAAGCCCACCCTCCAGGGGAGGCTGGAGGCGGATTTTCC 60 
I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | 
CGTTATGGCACGCCGGCTGCCCAAGCCCACCCTCCAGGGGAGGCTGGAGGCGGATTTTCC 4 927 4 

AGACAGTCCCCTGCTTCCTAAATTTCAAGAGCTGAACCAGAATAATCTCCCCAATGATGT 120 
I I M I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

AGACAGTCCCCTGCTTCCTAAATTTCAAGAGCTGAACCAGAATAATCTCCCCAATGATGT 4 9334 

TTTTCGGGAGGCTCAAAGAAGTTACCTGGTATTTCTGACATCCCAGTTCTGCTACGAAGA 180 

I I I I I I Ml II I I I I I I I I I I I I II I I I I I | | | | | 

TTTTCGGGAGGCTCAAAGAAGTTACCTGGTATTTCTGACATCCCAGTTCTGCTACGAAGA 4 9394 

GTACGTGCAGAGGACTTTTGGGGTGCCTCGGCGCCAACGCGCCATAGACAAGAGGCAGAG 24 0 
I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | 
GTACGTGCAGAGGACTTTTGGGGTGCCTCGGCGCCAACGCGCCATAGACAAGAGGCAGAG 4 94 54 

AGCCAGTGTGGCTGGGGCTGGTGCTCATGCACACCTTGGCGGGTCATCCGCCACCCCCGT 300 
I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | 
AGCCAGTGTGGCTGGGGCTGGTGCTCATGCACACCTTGGCGGGTCATCCGCCACCCCCGT 4 9514 

CCAGCAGGCTCAGGCCGCCGCATCCGCTGGGACCGGGGCCTTGGCATCATCAGCGCCGTC 360 

N M I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | 

CCAGCAGGCTCAGGCCGCCGCATCCGCTGGGACCGGGGCCTTGGCATCATCAGCGCCGTC 4 957 4 

CACGGCCGTAGCCCAGTCCGCGACCCCCTCTGTTTCTTCATCTATTAGCAGCCTCCGGGC 420 
I I N I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

CACGGCCGTAGCCCAGTCCGCGACCCCCTCTGTTTCTTCATCTATTAGCAGCCTCCGGGC 4 9 634 

CGCGACTTCGGGGGCGACTGCCGCCGCCTCCGCCGCCGCAGCCGTCGATACCGGGTCAGG 4 8 0 
I I M I I I I I I I I I I I I I | | | | | | | | | | || | | | | | | | | | | | | | | | | | | | | | | | | | | 

CGCGACTTCGGGGGCGACTGCCGCCGCCTCCGCCGCCGCAGCCGTCGATACCGGGTCAGG 4 9694 



Qy 


1 


Db 


49215 


Qy 


61 


Db 


49275 


Qy 


121 


Db 


49335 


Qy 


181 


Db 


49395 


Qy 


241 


Db 


49455 


Qy 


301 


Db 


49515 


Qy 


361 


Db 


49575 


Qy 


421 


Db 


4 9635 
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Qy 481 TGGCGGGGGACAACCCCACGACACCGCCCCACGCGGGGCACGTAAGAAACAGTAG 535 

I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 1 1 I I I I I 

Db 49695 TGGCGGGGGACAACCCCACGACACCGCCCCACGCGGGGCACGTAAGAAACAGTAG 49749 



RESULT 8 

EBV 

LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



EBV 172281 bp DNA circular VRL 18-APR-2005 

Epstein-Barr virus (EBV) genome, strain B95-8. 

V01555 J02070 K01729 K01730 V01554 X00498 X00499 X00784 

V01555.1 GI:59074 

DNA polymerase; EBNA; genome; ribonucleotide reductase; tandem 
repeat; terminal repeat. 

Human herpesvirus 4 (Epstein-Barr virus) 
Human herpesvirus 4 

Viruses; dsDNA viruses, no RNA stage; Herpesviridae; 

Gammaherpesvirinae; Lymphocryptovirus . 

1 

Arrand,J.R., Rymo,L., Walsh, J. E., Bjorck,E., Lindahl,T. and 
Griffin, B.E. 

Molecular cloning of the complete Epstein-Barr virus genome as a 

set of overlapping restriction endonuclease fragments 

Nucleic Acids Res. 9 (13), 2999-3014 (1981) 

6269068 

2 

Kozak,M. 

Possible role of flanking nucleotides in recognition of the AUG 

initiator codon by eukaryotic ribosomes 

Nucleic Acids Res. 9 (20), 5233-5252 (1981) 

7301588 

3 

Deininger, P.L. , Bankier,A., Farrell,P., Baer,R. and Barrell,B. 

Sequence analysis and in vitro transcription of portions of the 

Epstein-Barr virus genome 

J. Cell. Biochem. 19 (3), 267-274 (1982) 

6296170 

4 

Farrell , P . J . , Bankier,A., Seguin,C, Deininger, P. and Barrell,B.G. 

Latent and lytic cycle promoters of Epstein-Barr virus 

EMBO J. 2 (8), 1331-1338 (1983) 

10872327 

5 

Farrell, P.J. , Deininger , P . L . , Bankier,A. and Barrell,B. 
Homologous upstream sequences near Epstein-Barr virus promoters 
Proc. Natl. Acad. Sci . U.S.A. 80 (6), 1565-1569 (1983) 
6300857 

6 (bases 142687 to 159853) 

Bankier,A.T. , Deininger , P . L . , Farrell, P.J. and Barrell,B.G. 
Sequence analysis of the 17,166 base-pair EcoRI fragment C of B95-8 
Epstein-Barr virus 

Mol. Biol. Med. 1 (1), 21-45 (1983) 
6092825 

7 (bases 112620 to 125316) 
Seguin,C, Farrell, P.J. and Barrell,B.G. 

DNA sequence and transcription of the BamHI fragment B region of 

B95-8 Epstein-Barr virus 

Mol. Biol. Med. 1 (3), 369-392 (1983) 

6094953 

8 (bases 45644 to 52450) 
Jeang,K.T. and Hayward,S.D. 
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